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The attributable risk, often called the population attributable risk, is in 
many epidemiological contexts a more relevant measure of exposure-disease 
association than the excess risk, relative risk, or odds ratio. When estimat- 
ing attributable risk with case-control data and a rare disease, we present a 
simple correction to the standard approach making it essentially unbiased, 
and also less noisy. As with analogous corrections given in Jewell (1986) for 
other measures of association, the adjustment often won't make a substantial 
difference unless the sample size is very small or point estimates are desired 
within fine strata, but we discuss the possible utility for applications. 
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1 Attributable risk 



Although an exposure may be strongly associated with a disease, an inter- 
vention removing the risk factor can produce a limited public health benefit 
if the exposure is uncommon. Herein lies the interpretational problem with 
the measure of exposure-disease association 

RR = relative risk = pr(disease|exposed)/pr(disease|unexposed). 

A different measure of association is Levin's (1953) attributable risk 

AR = {pr (disease) — pr (disease | unexposed) } /pr (disease) 
= pr (exposed | disease) (1 — 1/RR) , 

where equality of the two lines can be seen from (5.1) of Jewell (2003). The 
attributable risk is informally the proportion of cases that could be saved 
by eliminating exposure, assuming no confounding. For instance, a value 
of 0.1 suggests a tenth of cases are due to exposure. When contemplating 
interventions, such knowledge can be more relevant than the relative risk. 

While the attributable risk is not directly identifiable with case-control 
sampling, for a rare disease it is approximately 

AR* = pr(exposed|disease) (1 — 1 /OR) , 

for 

^ pr(exposedldisease) prfunexposedl disease free) 

° R = ° ddS rati ° = p'unlposedlcfeease) V^pldlcLse fa./ - 

The reasoning behind the approximation is that 

AR* AR P r ( ex P ose d|disease) ( ^ pr(disease free|exposed) 



RR \ pr(disease free (unexposed) 

and the factor on the right will be close to zero if the disease is uncommon for 
both the exposed and unexposed. Case-control studies are most appropriate 
for rare diseases, so the following discussion assumes AR* AR, and we 
henceforth take AR* as our desired measure of association. 



2 



In a case-control study, we assess exposure status for m cases and n 
controls. The statistical model is 



number of cases exposed = A ~ Bin(m, q), < q < 1 
number of controls exposed = B ~ Bin(n,p), < p < 1, ALB 
number of cases unexposed = C = m — A 
number of controls unexposed = D = n — B. 

We can now formally define our parameter of interest as 

AR* = q(l- 1 ~ q P 



q 1-p 

2 Small sample correction 

For additional details we refer to Whittemore (1982) or Jewell (2003, section 
7.4), but the usual maximum likelihood estimator is given by 

ar = a bc 



A + C (A + C)D 

There is ambiguity when D = 0, and if the definition is taken literally the es- 
timator has bias of — oo. Since A + C = m, this part of the two denominators 
does not cause problems. We propose the small sample correction 

AR SS A 



A + C (A + C)(D + iy 

The following theorem is proven in the appendix, and states that bias de- 
creases exponentially with the number of controls. 

Theorem 1. E[AR SS ] ~ AK* = (±=£) p n+1 = 0( e - n ). 

For problems in statistics where exact unbiasedness is impossible, ex- 
ponentially decreasing bias is somewhat unusual. A Taylor expansion can 
frequently show ratios of empirical means and the like have bias f2(n _1 ), 
where n is the sample size. The jackknife was originally introduced to reduce 
such biases to 0(n~ 2 ) (Quenouille, 1956), which is a far cry from 0(e~ n ). 
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Ideally we would only investigate exposures that are associated with dis- 
ease, or at least neutral. In such a setting, the (1 — q)/(l — p) constant in 
the bias is no larger than one. 

In line with its interpretation of yielding the percentage of cases due to 
exposure, attributable risk is often reported to two decimal places. Hence, 
say our estimator is essentially unbiased if bias is less than 0.005. If the 
exposure is not ubiquitous, so less than half of controls are expected to be 
exposed, then only nine controls are needed to ensure essential unbiasedness. 

Table 1 compares the bias of the standard and corrected estimator. As 
mentioned, the standard estimator does not even have finite bias, but we 
artificially level the playing field by setting AR = AR S s when D — 0. The 
exact bias of the standard estimator can then be calculated by enumerating 
outcomes for the binomial distribution. Results are shown for a handful of 
parameter values (q,p) across the unit square, for a small study with ten 
cases and ten controls. The corrected estimator clearly seems preferable. 

Table 1: Bias of the standardjmd corrected_estimator, with m — 10 cases 
and n — 10 controls, where AR is set to AR S s if D — 0. The corrected 
estimator appears to have smaller bias. 



Q 


p 


AR* 


Bias(AR) 


Bias (AR SS ) 


0.20 


0.20 


0.0000 


-0.0297 


0.0000 




0.40 


-0.3333 


-0.1216 


0.0001 




0.60 


-1.0000 


-0.4647 


0.0073 




0.80 


-3.0000 


-1.0640 


0.3436 


0.40 


0.20 


0.2500 


-0.0223 


0.0000 




0.40 


0.0000 


-0.0912 


0.0000 




0.60 


-0.5000 


-0.3485 


0.0054 




0.80 


-2.0000 


-0.7980 


0.2577 


0.60 


0.20 


0.5000 


-0.0149 


0.0000 




0.40 


0.3333 


-.0608 


0.0000 




0.60 


0.0000 


-0.2324 


0.0036 




0.80 


-1.0000 


-0.5320 


0.1718 


0.80 


0.20 


0.7500 


-.0074 


0.0000 




0.40 


0.6667 


-.0304 


0.0000 




0.60 


0.5000 


-0.1162 


0.0018 




0.80 


0.0000 


-0.2660 


0.0860 
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The bias reduction would be a Pyrrhic victory if it induced a large spike 
in variance. Fortunately, this does not occur because the correction tends 
to stabilize things. The standard method has infinite variance since D = 
can appear in the denominator, but even after conditioning on the estimator 
being properly defined, the small sample correction still improves variance. 
The following theorem makes this precise, and our proof is in the appendix. 

Theorem 2. var(ARss\D ^ 0) < var(AR)\D ^ 0). 

The correction is thus meant to reduce both bias and variance, the com- 
ponents of mean squared error, or the traditional measure of risk in statistical 
decision theory. Our problem is a counterexample to the common situation 
where bias correction is ill-advised since the variance increase is prohibitive 
(Doss and Sethuraman, 1989; Efron and Tibshirani, 1993, section 10.6). 

3 Does the correction make a difference? 

Our method has several theoretical advantages that we have discussed, it can 
be implemented by hand, and we avoid definitional problems when D — 0. 
The procedure could prove useful with a small sample size, or when con- 
founding necessitates cross-tabulation and point estimates are desired within 
relatively fine strata. However, replacing D by D + 1 in the estimator will of- 
ten result in a very minor perturbation, and in this sense our proposal brings 
to mind the longstanding but frequently meaningless debate over whether to 
multiply a sample variance by n/(n — 1). 

To see the utility of our method, we should note that it is an analog of 
relative risk and odds ratio corrections of Jewell (1986). Adjustments follow 
from the fact that if X ~ Bin(n,p), then (n + l)/(X + 1) and X/(n-X + 1) 
are corrected estimators of 1/p and pj (1—p). Jewell (2003, section 7.1) states 
that the "principal value of a small sample adjustment is in its alerting us to 
situations where the sample size is small enough to have a noticeable impact 
on an estimator, thereby suggesting that large sample approximations may 
be suspicious." This seems like good advice for attributable risk estimation, 
for which our method should have a modest but positive impact. 
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Appendix 



Proof of Theorem 1. We begin by noting that since B ~ Bin(n,p), we have 



E 



B 



D+l 



= E 

n 

= E 



B 



k=0 
n 



= E 



n-B + 1 
n\ 

k\{n-k)\ 
n\ 
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n—k 



n—k 
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k 
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n-(fc-l) 



1 - v ^ Hi 



n-l 



p * — ' i\\n — i)\ 



P 



1-p 

P 



(1 -pr(£ = n)) 



P P n 



1 — p 1 — p 



1 — p 1 — p 

Further, E[A/(A + C)] = E[A/m] = q since A ~ Bin(m, q). Likewise, 
E[C/{A + B)] = 1 - q. By the independence of (A, C) and (B,D), we 
conclude that 



E[AR SS ] = E 
= E 



' A ' 




— E 


A + C_ 




' A ' 






-E 


A + C_ 



C B 
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The desired result now follows after expressing 

l-q p 



AR* = q(l 



q 1-p 



q-(l-q) 



P 



1-p 



□ 
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Proof of Theorem 2. We first write 

A — m ( n — D n — D 



AR = AR SS + 



m 



D 



D + l 



yielding 



vax(AR\D + 0) = var(AR ss \D ^ 0) 

A — m ( n — D n — D 

+ var 



+ 2 cov 



m 



AR 



A — m ( n — D n — D 



ss, 



m 



D 



D + l 



Both arguments of the covariance are increasing in A and D. This implies 
the covariance term is positive, and hence the desired inequality. □ 
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